﻿ An Incremental Discourse Parser Architecture Dan Cristea “Alexandru Ioan Cuza” University of Ia•i, Faculty of Computer Science 16, Berthelot St , 6600 – Ia•i, Romania dcristea@infoiasi ro Abstract We present a discourse parsing architecture based on an incremental approach and aimed at building a rhetorical structure of a free text Vein expressions computed on the developing structure help to restrict the domains of referential accessibility on which resolution of anaphora is performed The parsing process is guided by cohesion and coherence constraints 1 Introduction Incremental discourse parsing is particularly challenging to systems aimed at doing automatic text processing because of its resemblance with the linear processing of texts done by humans Although humans process texts linearly, there are strong reasons to believe that they build structures which are not linear but hierarchical and that they use these structures to further resolve co-references and to build interpretations Theories like Attentional State Theory , and Rhetorical Structure Theory argued in favour of a hierarchical structure of discourse There is debate whether the discourse structure must resonate with the resolution of anaphora (see for instance ) We believe there is tight relationship between discourse structure and references In a system that integrates the expertise of several modules (syntactic, semantic, common sense inference, discourse planning, anaphora resolution, cue- words and temporal) in order to build incrementally the meaning of free texts was presented The behaviour is based on the assumption that the performance of the system could be affected but is not vitally dependent on any of the contributing expert modules The more one knows the more accurate the representation could be If less is known, less is obtained, but the system can still survive In this paper we describe a configuration that integrates a part of speech tagger, a cue-expert module, a discourse parser and a reference resolution module Section 2 presents the theoretical clues the approach is anchored in: Veins Theory and expectation-based incremental parsing, section 3 explains in detail the behaviour of the system on a particular fragment of English text and section 4 compares the proposed architecture with other work 2 Veins Theory Veins Theory uses a representation of the discourse structure as a binary tree, close to the representation in Rhetorical Structure Theory (RST) Terminal nodes in the tree represent discourse units and non-terminal nodes represent continuous text spans and are labelled with rhetorical relations Usually a label uniquely identifies a unit A polarity is established among the children of a relation, which identifies at least one node, the nucleus, considered essential for the writer’s purpose; non-nuclear nodes, which include spans of text that increase understanding but are not essential to the writer’s purpose are called satellites VT operates with two concepts: head and vein, which are expressions of terminal labels (discourse unit labels) attached to each node in the discourse structure Head expressions (equivalent to Marcu's promotion sets ) are computed bottom-up: < the head of a terminal node is its label; < the head of a non-terminal node is the concatenation of the heads of its nuclear children Veins expressions are sub-sequences of the sequence of unit labels making up the discourse and are computed top-down: < the vein expression of the root is its head expression; < for each nuclear node whose parent node has vein v, the vein expression is: if the node has a left non-nuclear sibling with head h, then seq(mark(h), v), where mark(x) is a function that takes a string of symbols x and returns each symbol in x marked in some way (e g with parentheses) and seq(x, y) is a sequencing function that computes that permutation of x concatenated with y given by the left to right reading of the sequence of labels in x and y on the terminal frontier of the tree otherwise (if the node has no left sibling or its left sibling is nuclear), v; < for each non-nuclear node of head h whose parent node has vein v, the vein expression is: if the node is the left child of its parent, then seq(h,v) otherwise, seq(h, simpl(v)), where simpl(x) is a function that eliminates all marked symbols from its argument, if they exist The definition for the vein expression of a node is intended to put in evidence a list of units of the discourse that make up a comprehensible context for the span of text covered by the node In particular, the vein expression of the root node gives a summary of the whole discourse, while the vein expression of a terminal node – a unit of the discourse, reveals the minimum context around that unit that makes the unit understandable The particular interest given to left satellites resides from empirical observations that they are accessible to their immediate to the right nuclei, but inaccessible to subsequent satellites The domain of referential accessibility (DRA) of a unit is defined as the string of unit labels appearing in its vein expression and prefixing that unit label itself Ex 1 1 John sold his bicycle 2 although Bill would have wanted it 3 He obtained a good price for it, 4 which Bill could have not afforded 5 Therefore he decided to use the money for going in a trip The RST structure of Ex 1, as put in evidence by the cue-words although, which, therefore, must be one of those evidenced in Fig 1 VT makes two important claims The first regards discourse cohesion, the second discourse coherence NARATION CONCLUSION H = 1 3 H = 1 3 = 1 3 V = 1 3 V CONCESSION CONCLUSION NARATION UNIT 5 H = 1 H = 3 H = 1 3 H = 5 V = 1 3 = 1 3 V = 1 3 V = 1 3 5 V DRA = 1 3 5 UNIT 1 UNIT 2 ELABORATION UNIT 5 CONCESSION ELABORATION H = 1 H = 2 H = 3 H = 5 H = 1 H = 3 V = 1 3 = 1 2 3 V = 1 3 V = 1 3 5 V = 1 3 V = 1 3 V DRA = 1 = 1 2 DRA = 1 3 5 DRA UNIT 3 UNIT 4 UNIT 1 UNIT 2 UNIT 3 UNIT 4 H = 3 H = 4 H = 1 H = 2 H = 3 H = 4 V = 1 3 = 1 3 4 V = 1 3 V = 1 2 3 V = 1 3 V = 1 3 4 V DRA = 1 3 DRA = 1 3 4 DRA = 1 DRA = 1 2 DRA = 1 3 DRA = 1 3 4 Fig 1 Possible RST structures for Ex 1 2 1 VT and Discourse Cohesion Veins Theory claims that references from a given unit are possible mainly in its domain of referential accessibility By restraining the domains the antecedents of anaphors are to be looked for, VT makes the existing anaphora resolution techniques more effective and more efficient The model we have studied for experimenting VT-driven anaphora resolution is based on a three tiered representation and a two steps process The first layer is that of the surface text Referring expressions (REs) belong to this layer The second layer is the one of the morpho-semantic restrictions This layer is populated with feature structures that display sets of morphological and semantic feature-value pairs To each RE on the textual layer correspond one or more representations on the morpho-semantic layer We say that the morpho-semantic signs are projected there from the textual layer The third layer is the situational layer, also populated with feature structures, but where all references are resolved There is only one representation for each discourse entity on this layer We say that the entities on the situational layer are evoked there from the restrictions layer (Fig 2 displays the three layers for two co-referencing REs) Let’s notice that the composition of the projects and evokes relations yields the realises relation of centering As such, the world of REs can be seen as a simplification of the overall dynamic model of language interpretation, where we retain only REs and representations emerged from them e text layer ………………………… a ………………… b Th RE a projects RE b projects fsb The restrictions layer …………………… fsa …………………… fsa evokes centera fs evokes center a b The situations layer ……………… centera … Fig 2 The three-layer representation for two co-referencing REs Different theories of anaphora resolution use specific evoking mechanisms For instance, the resolution of pronominal anaphors could impose the agreement in person, number, gender and a semantic feature (human or non-human) The evoking mechanism of noun phrase anaphors could be more complicated, as they could involve also synonymy and hipernymy relations as well as navigation processes in a WordNet- like conceptual hierarchy VT sees references in conjunction with discourse structure The following types of references are put in evidence: Direct references If A and B are units in this textual order, A belongs to the DRA of B and aDA is linearly the most recent (to B) RE that realizes that same center as bDB, we say that b directly co-refers the center evoked by a (see Fig 3) If A and B are units in this order, A belongs in the DRA of B and aDA is linearly the most recent RE (to B) that realises a role of bDB, we say that b functionally directly co-refers the center realised by a To put in evidence the DRAs of units, in the following graphical representations the units and the veins they belong to are drawn above REs Also, to simplify notation, only the first and the second layers are shown A B A B The text layer ………… a ………………… b … … a ………………… b … centera centerb The situations layer … centera …………………………… ……………… role a direct co-reference b functional direct co-reference Fig 3 Direct co-reference: the linearly most recent RE that realises the same center of an RE is shown by a dashed arrow and the realisation relation by a full arrow Indirect references If A, B and C are units in this order, bD B is linearly the most recent (to C) RE that realises the same center as cDC, B is not on the DRA of C, A is linearly the most recent (to B) unit that is both on the DRA of B and of C, and it contains a RE aDA such that bDB realises the same center as a, we say that c indirectly co-refers the center realised by a (see ) A similar definition applies for indirect functional references ts and DRAs …………………… … A B C uni text ………………………… ………… a ……… ……… b c situations ………… ………………… centera Fig 4 Indirect co-reference In other words, an indirect reference occurs when the DRA expression of the anaphor’s unit intersects the one of the most recent antecedent, there is a unit on this common segment that realises the same center as the one referred by the anaphor and this unit is not linearly the most recent to the anaphor In still another words, an indirect reference occurs when the chain of units of backward looking co-references intersects the DRA of anaphor’s unit in a unit that is not the most recent to the anaphor’s unit Anything that does not obey the direct or indirect references are inferential references This happens when there is no intersection between the backward looking chain of co-references’ units and the DRA of the anaphor's unit A particular category of inference references is something that we would call pragmatic references, or pseudo references These are REs that practically do not refer back in the discourse although an entity identically realised was already introduced If she says that she waked up this morning at the same time with the sun and, after a few sentences, she says that no cloud covers the sun, it is questionably that in this last sentence she refers back to her first mention of the celestial object One can understand the last sentence without making the connection with the first one We all know about sun and anytime we are said about it we know what the concept is because it is outside the text, it is the same to all of us, it is pragmatic knowledge But if we make a story about the sun in which it is passing through different events, from its raising to its dawning, than here we have references Although everybody knows what the sun is, one can understand the story only linking the textual entities mentioned in sequence to the same situational entity Because they are told about in the same story, doing things or participating in connected events, it is important that one has the same mental representation about all the mentioned suns, and not only knowing them as having identical functionality The difference between cases as the ones mentioned could be reflected, although not necessarily, by considering the references in connection with the veins structure It is most plausible that references in the first example could appear as inferential references, while references in the second example could be recovered as direct or indirect references Inferential references seem to minimize the importance of the domain of referential accessibility, as defined in VT, because references can now “escape” from the domain Does anymore the domain of accessibility have any significance? Is it an artificial invention or it is defended by a natural characteristic of the manner humans process texts? We claim that there are two significantly distinct types of anaphora resolution processes: evocative (or associative) and post-evocative (or inferential) The evocative resolution processes are due to direct and indirect references They are based on associations, which are processes of pattern-matching on feature structures decorated with morpho-semantic attributes They are performed between a feature structure projected by the anaphor RE and a center that exists already in the DRA of the unit the anaphor belongs to As such, they are performed between layer two and layer three feature structures These are fast processes, direct ones being faster and more frequent than indirect ones On the other hand, the post-evocative processes are inferential processes that are developed in memory, based on the knowledge accumulated by the preceding discourse, or based on the cultural knowledge the subject owns We believe these inferences swings the semantic space in an order that is also dictated by discourse structure Eventually, the target entity can be found based on a pattern-matching process between the projected structure of the anaphor and the center of the antecedent They are slow (compel to more inference load), necessitate more powerful referencing means (like proper nouns) and are less frequent processes Fig 5 shows different types of anaphoric relations for the discourse of Ex 1 ojection restrictions evocative and situations a nd unitstheir REs pr DRAs processes post-evocative processes pos = proper nounJohn type = person type = person John name = John name = John gender = masc nder = masc ge num=sgnum = sg 1 pos = def noun bicycle1 type = bicycle his bicycle owner = [John] type = bicycle ner = [John] = ¬human owgender gender = ¬human number=sgnumber = sg pos = proper noun type = person Bill Bill name = Bill type = person name = Bill gender = masc 2 =sggender = masc numnum = sg pos = pron it gender = ¬human ber = sg num pos = pron direct references he type = human nder = masc ge num = sg price1 pos = undef noun type = price 3 price type = price numgender = ¬human = sg number = sg pos = pron it gender = ¬human ber = sg num which pos = pron indirect reference 4 pos = proper noun type = person Bill name = Bill gender = masc inferential reference num=sg pos = pron he type = human nder = masc ge num = sg mon ey1 pos = def noun type = money 5 the money type = money number = pl num = pl trip1 a trip pos = undef noun type = trip number = sg number = sg Fig 5 Anaphoric relations in connection with discourse structure on Ex 1 2 2 VT and Discourse Coherence The second claim of VT extends the classical Centering Theory (CT) at global level As defined, centering is a theory of local coherence CT defines a set of transition types for discourse , : CONTINUATION which is easier to process than RETAINING, which is easier than SMOOTH SHIFTING, which is easier than ABRUPT SHIFTING In a smoothness index was introduced to compare different discourse structures and interpretations The smoothness index for a transition is based on the CT natural rating of transitions: CONTINUATION = 4, RETAINING = 3, SMOOTH SHIFTING = 2, ABRUPT SHIFTING =1, no backward-looking center (Cb) = 0 Then the scores for each transition in the entire segment are summed up, and the result is divided by the number of transitions in the segment An index of the overall coherence of the segment, called global smoothness score, is such obtained VT claims that the global smoothness score of a discourse when computed following the neighbouring metric given by vein expressions, following VT, is at least as high as the score computed following the adjacency metric recommended by CT, when extended to the whole discourse By this, VT claims that long-distance transitions computed over return- pops, using vein expressions, are systematically smoother than accidental transitions at segment boundaries We use the global smoothness score as criteria of acceptability of a discourse structure Table 1 and Table 2 exemplify transitions and global scores in CT and, respectively, VT for Ex 1 A comparison between the two tables clearly show the better global score obtained in the hierarchical case The three sequences of units form coherent argumentation lines: 1 John sold his bicycle 2 although Bill would have wanted it 1 John sold his bicycle 3 He obtained a good price for it, 4 which Bill could have not afforded 1 John sold his bicycle 3 He obtained a good price for it, 5 Therefore he decided to use the money for going in a trip and, on each of them, the discourse is smoother then in the original overall discourse Table 1 Transitions and scores on a linear adjacency metric (J = [John], b = [John's bicycle], B = [Bill], p = [price], m = [the money], t = [a trip]) 1 2 3 4 5 CF J, b B, b J, p, b p, B J, m, t CB J b b p - Trans ASH RET SSH no Cb Score 1 3 2 0 Global 6/4 = 1 5 Table 2 Transitions and scores on a hierarchical adjacency metric (same notations as in Table 1) Only one transition is computed for each unit (excepting the first) 1 2 1 3 4 1 3 5 CF J, b B, b J, b J, p, b p, B J, b J, p J, m, t CB J b J J p J J J Trans ASH CON SSH CON CON Score 1 4 2 4 Global 11/4 = 2 75 2 3 Incremental Discourse Parsing Expectation-driven incremental discourse parsing applies to the processing of discourse an idea of incremental parsing developed initially for syntactic analysis within the LTAG framework Two operations on trees, adjunction and substitution, add at each step a small piece of structure to the current developed tree (CDT) Adjoining adds an auxiliary tree with at least one material (discourse unit) node Substitution unifies the root of a substitution structure with an empty node (denoting an expectation) of a CDT These operations can be performed only on the generalised right frontier of the CDT, equal to the right frontier if CDT is expectation nodes free, or the right frontier of the subtree rooted by the left sibling of the inner- most expectation node, otherwise 3 A Mixed Approach in Discourse Parsing Using VT as a guiding theory in an incremental discourse parsing approach is based on the assumption that the tree resulted after the parsing process, among the family of all possible RST-like discourse trees composed with rhetorical relations induced by cue-words of the text, is the one which manifests the more natural overall references over the discourse structure and the smoothest overall CT transitions on veins The computation of centering transitions is based heavily on the resolution of anaphoric expressions which are looked for within the domains of accessibility given by the vein expressions in the developing tree In what follows we investigate the behaviour of a system with a more restricted architecture than the one presented in From the set of optional knowledge sources we retain only four: a part-of-speech tagger (PosTag) – which provides morphological markers for each word, a cue-words expert (CueExp) – which splits the text in discourse units1 and provides hints with respect to identification of rhetorical relations and their place of insertion based on a collection of cue-words heuristics, a reference resolution expert (RefExp) – aiming to solve inter-unit pronominal and functional anaphora, and an incremental discourse parser (DiscPar) – which builds the associated discourse tree(s) 2 On the text in Ex 1 the incremental parser performs as shown in Fig 6 The first step builds the trivial tree made of only one unit As this unit initialises the discourse, RefExp will introduce two semantic entities (centers) corresponding to John and his bicycle, as in Fig 5 1 Minimally a clause, maximally a dot-to-dot compound sentence 2 To make the text shorter, some details are skipped Based on the cue-word although, CueExp proposes for the auxiliary tree of the second unit a CONCESSION relation and the discourse parser adjoins it in the unique node of the right frontier – unit 1 (in Fig 6 nodes of the right frontier are numbered in sequence bottom-up - 0 being the lowest right one) RefExp will find that the projected FS of Bill doesn't match any of the centers [John] and [bicycle1], both placed on the vein of unit 2, the only existing FS on the semantic layer at the moment, and this will result in introducing a new entity, [Bill] As for it, it will match [bicycle1] Step 1 Step 2 Step 3 1CDT:CONCESSION UNKNOWN 1 0 2*3* adjoining in node 0 adjoining in node 1 in node 0 of CDT of CDT adjoining 22 01CONCESSIONUNKNOWN CDT3: CDT3: : CONCESSION 1 CDT22 1 4 1CONCESSIONUNKNOWN 3Step 1 1 0 2 0 ELABORATION 3221 0 * 4 adjoining in node 0 adjoining in node 0 of CDT0 of CDT1 33 Go on with tree CDT1 41 0 : CDT4: CONCESSION CDT4UNKNOWN 21 3 1 UNKNOWN CONCESSION ELABORATION 1 2 2 ELABORATION 1 2 3 4 0 Step 5 1 CONCLUSION 3 4 0 * 5 adjoining in node 0 adjoining in node 1 adjoining in node 2 111 of CDT4 of CDT4 of CDT4 012 : CONCLUSION CDT5: UNKNOWN CDT5: UNKNOWN CDT5 CONCESSION ELABORATION CONCESSION CONCLUSION UNKNOWN 5 1 1 CONCESSION ELABORATION 2 3 CONCLUSION 2 ELABORATION 5 1 2 3 4 4 5 3 4 Fig 6 Incremental tree building for the discourse in Ex 1 There is no cue word in unit 3 that could hint on the type of relation, so CueExp proposes an UNKNOWN relation as root of the auxiliary tree At step 3 there are two places (marked 0 and 1) on the right frontier of the CDT where this auxiliary tree 2 could be adjoined by DiscPar Adjoining at node 0 would raise CDT0 such that 3 vein(3) = 1 2 3, while adjoining at node 1 yields CDT1 where vein(3) = 1 3 RefExp is 3 trying to resolve the anaphors using the DRAs given by VT on the already developed tree As the domain of unit 3 on CDT1 is 1 3, the REs he, price and it of this unit will 3 be matched against semantic entities introduced by REs of units 3 and 1 (in this order) and will, therefore, be found to be [John], [price1] (no match, newly introduced) and [bicycle1], respectively Note that the pronoun he will not be erroneously linked to the entity [Bill] with whom it resonate and which is linearly more recent In the variant of CDT0 the domain of unit 3 is 1 2 3 and both [John] and [Bill] are candidates for being 3 referents of he, while it could refer the same [bicycle1] as before When he is chosen to evoke [John] the centering transition from 2 to 3 is ABRUPT SHIFTING while if he would be [Bill], the transition is CONTINUING Therefore on CDT0 he in 3 is more 3 plausible to be [Bill] than [John] (obviously an erroneous choice), while in CDT1 he 3 is unequivocally [John] At this moment the parser has two structures that will continue to be developed in parallel until one is significantly better scored than the other At step 4 CueExp founds the cue word which to be an indication that this unit is an 0 ELABORATION of the preceding one Therefore on CDT3 only one place of the three on the right frontier is found of interest, while on CDT1 – only one of the two The 3 resulting trees are CDT0 that gives vein(4) = 1 2 3 4 and CDT1 with vein(4) = 1 3 4 44 The variant when he in unit 3 is bound to [Bill] would result in the case of CDT0 to 4 the extremely low probable reference (see ) from the proper noun Bill in unit 4 to the entity [Bill] realised by a pronoun in unit 3 This makes that at this moment, of the two possible variants of CDT0, to be retained the one that indicated he in 3 being 4 bound to [John] and Bill in 4 referring back in unit 2 to [Bill] On the tree CDT1 the 4 same links are found, although now unit 2, not being on the vein of unit 4, the reference Bill from 4 to 2 is of an inference nature, still plausible (being realised through a proper noun) The overall smoothness scores of the trees CDT0 and CDT1, 44 are compared (6 – on the sequence 1 2 3 4 as in Table 1, versus 7 – on the sequences 1 2 and 1 3 4 as in Table 2) and the best is retained: CDT1 4 The last step exploits the cue-word therefore of unit 5 There are three places on the right frontier of the CDT1 where the adjoining of the auxiliary tree headed by a 4 CONCLUSION relation could be operated (marked 0, 1 and 2 on Fig 6) As CueExp gives no indication as to where to place the aux tree, three final trees will be obtained A possible rating is done in correlation with the resolution of the pronoun he and the definite noun the money On CDT0 vein(5) = 1 3 4 5, on CDT1 vein(5) = 1 3 5 and on 55 CDT2 vein(5) = 1 5 5 CDT0 where he in 5 is [Bill] is a rather strange interpretation where the money are 5 those Bill intended to give for buying John’s bike, and because they were not enough he decided to use them for going in a trip CDT1 gives an interpretation where the 5 decision of John to use the money for going in a trip came as a result of John obtaining a good price for his bike, and is inferable that the money are the money John obtained for the bike CDT2 evokes the interpretation in which the decision to go in a trip is 5 only a result of John selling the bike The money are again those obtained for the bike The CT transitions rates for the three interpretations are given in the following tables: Table 3 Transitions corresponding to CDT0 and he in 5 = [Bill] 5 1 2 1 3 4 5 CF J, b B, b J, b J, p, b p, B B, m, t CB J b J J p B Trans ASH CON SSH SSH Score 1 4 2 2 Global 9/4 = 2 25 Table 4 Transitions corresponding to CDT0 and he in 5 = [John] 5 1 2 1 3 4 5 CF J, b B, b J, b J, p, b p, B J, m, t CB J b J J p - Trans ASH CON SSH no Cb Score 1 4 2 0 Global 7/4 = 1 75 Table 5 Transitions corresponding to CDT1 and he in 5 = [John] 5 1 2 1 3 4 1 3 5 CF J, b B, b J, b J, p, b p, B J, b J, p, b J, m, t CB J b J J p J J J Trans ASH CON SSH CON CON Score 1 4 2 4 Global 11/4 = 2 75 Table 6 Transitions corresponding to CDT2 and he in 5 = [John] 5 1 2 1 3 4 1 5 CF J, b B, b J, b J, p, b p, B J, b J, m, t CB J b J J P J J Trans ASH CON SSH CON Score 1 4 2 4 Total 11/5 = 2 75 The scores given in these tables are used to rate the trees obtained The final trees of the analysis are thus CDT1 and CDT2, very close to the ones in Fig 1 55 4 Conclusions In this paper we revise Veins Theory – a theory of discourse cohesion and coherence and show how it can be used in an architecture aimed at building discourse structure simultaneously with resolving anaphorae The discourse units are revealed from raw text by a collection of heuristics that rely on cue words Units are incrementally attached to the developing tree on the right frontier By computing the vein expression and the domain of referential accessibility of the current unit, the search for antecedents of anaphorae belonging to this unit is restricted The resolution of anaphorae permits determination of centering transitions using the hierarchical adjacency metric given by veins Transitions are then used to restrain the exponential explosion of the obtained trees by sorting the trees in the descending order of a global smoothness score and retaining only the most relevant ones for further processing Further work is necessary in order to establish the generosity of this approach compared to others (as the one in , for instance, that is very well suited for learning parser's moves from a corpus) Automatic learning of heuristics for splitting the discourse in units from an annotated corpus is equally possible in our approach It remains to establish the feasibility of applying learning techniques for acquiring heuristics that would indicate shapes of auxiliary trees and the places of their adjunction on the developing tree References 1 Brennan, S E , Walker Friedman, M , Pollard, C J : A centering approach to pronouns Proc of the 25th Annual Meeting of ACL, Stanford (1987) 2 Cristea, D , Ide, N , Marcu, D , Tablan, M V : Discourse structure and Co- Reference: An Empirical Study In: Proceedings of the Workshop Relations Between Discourse Structure and Reference, ACL Maryland (1999) 3 Cristea, D , Ide, N , Romary, L : Veins Theory - A Model of Global Discourse Cohesion and Coherence In: Proc of Coling/ACL’98, Montreal (1998) 4 Cristea, D Webber, B L : Incremental expectations in discourse parsing In: Proc of ACL/EACL, Madrid (1997) 5 Gordon P C , Hendrick R : The representation and processing of coreference in discourse In: Cognitive Science, 22 (1998) 6 Grosz, B J ; Joshi, A , Weinstein, S : Centering – a framework for modelling the local coherence of discourse In: Computation Linguistics, 12(2), June (1995) 7 Grosz, B , Sidner, C : Attention, intention and the structure of discourse In: Computational Linguistics, 12 (1986) 8 Mann, W C , Thompson S A : Rhetorical structure theory: A theory of text organization In: Text 8:3 (1988) 9 Marcu, D : A Decision-Based Approach to Rhetorical Parsing In: Proceedings of the ACL'99 (1999) 10 Mitkov, R : Robust Pronoun Resolution with Limited Knowledge In: Proceedings of COLING-ACL'98, Montreal (1998) 11 Shabes, Y : Mathematical and Computational Aspects of Lexicalized Grammars, Technical Report MS-CIS-90-48, LINC LAB 179 (1990) 12 Tablan, M V , Barbu, C , Popescu, H , Hamza, R O , Ciobanu, C , Ni••, I C , Boc•nial•, C D , Georgescul, M , Cristea D : Co-opertion and Detachement in Discourse Understanding In: Proc of the Workshop on Lexical Semantics and Discourse, ESSLLI’98, Saarbruecken (1988) 13 Walker M A : Centering, Anaphora Resolution and Discourse Structure In Walker, M A ; Joshi, A K ; Prince, E F (eds ): Centering in Discourse, Oxford University Press (1997) 